Ranking Classi cation Algorithms with Dataset Selection: Using Accuracy and Time Results
نویسندگان
چکیده
Given that wide variety of available classiication algorithms exists, the selection of the right algorithm to use on a new problem is an important issue. In this paper we present zooming, that analyzes a given dataset and selects relevant (similar) datasets used in the past. This process is based on the \distance" calculated on the basis of several dataset characteristics. The accuracy and time results associated with the selected datasets are then processed to generate an advice in the form of a ranking, indicating which algorithms should be applied in which order. Here we propose the adjusted ratio of ratios ranking method. The generalization power of this ranking method is analyzed and the experimental results indicate that zooming leads to better results on average. The work presented can be seen as a rst step towards a system to provide advice on the utility of diierent solution strategies.
منابع مشابه
Ranking Classi cation Algorithms Based on Relevant Performance Information
Given the wide variety of available classiication algorithms and the volume of data today's organizations need to analyze, the selection of the right algorithm to use on a new problem is an important issue. In this paper we present zooming, a technique that, for a given dataset, selects relevant past performance information. The selection process is based on the distance between the dataset at ...
متن کاملAttribute bagging: improving accuracy of classifier ensembles by using random feature subsets
We present attribute bagging (AB), a technique for improving the accuracy and stability of classi#er ensembles induced using random subsets of features. AB is a wrapper method that can be used with any learning algorithm. It establishes an appropriate attribute subset size and then randomly selects subsets of features, creating projections of the training set on which the ensemble classi#ers ar...
متن کاملFeature selection using Fuzzy Entropy measures with Yu ' s Similarity measure
In this study, feature selection in classi cation based problems is highlighted. The role of feature selection methods is to select important features by discarding redundant and irrelevant features in the data set, we investigated this case by using fuzzy entropy measures. We developed fuzzy entropy based feature selection method using Yu's similarity and test this using similarity classi er. ...
متن کاملFeature Selection: Evaluation, Application, and Small Sample Performance
A large number of algorithms have been proposed for feature subset selection. Our experimental results show that the sequential forward oating selection (SFFS) algorithm, proposed by Pudil et al., dominates the other algorithms tested. We study the problem of choosing an optimal feature set for land use classi cation based on SAR satellite images using four di erent texture models. Pooling feat...
متن کاملDeveloping a Filter-Wrapper Feature Selection Method and its Application in Dimension Reduction of Gen Expression
Nowadays, increasing the volume of data and the number of attributes in the dataset has reduced the accuracy of the learning algorithm and the computational complexity. A dimensionality reduction method is a feature selection method, which is done through filtering and wrapping. The wrapper methods are more accurate than filter ones but perform faster and have a less computational burden. With ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007